12-01-' 06 13:54 FROM- 412-741-9292 T-357 P005/011 F-791 

Atty. Docket No. YOR920Q00390US1 

(590.023) 

Amendments to the Claims : 

This listing of claims will replace all prior versions, and listings, of claims in the 
application. 

Listing of Claims : 

1-15, (Cancelled) 

16. (Currently Amended) The method according to Claim 3. A method of providing 
speaker adaptation in speech recognition, said method comprising the steps of: 

providing at least one speech recognition model; 

accepting speaker data; 

generating a word lattice having a plurality of paths based on the speaker data, wherein 
the step of generating the word lattice comprises considering language model probabilities by 
incorporating the language model probabilities into a transition probability: and 

adapting at teflSt ore of the spsalrer dfrfo qiid litre at least Que spegfih repQgwtiQDl HlQ<fel 
with respect to the generated word lattice in a manner to maximize the likelihood of the speaker 
data, 

wherein said step of generating a word lattice comprises generating a maximum a- 
posteriori probability word lattice. 

wherein said step of generating a maximum a-posteriori probability word lattice 
comprises: 
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determining posterior state occupancy probabilities for each state in the speaker 
data at each time: 

determining posterior word occupancy probabilities bv summing over all states 
interior to each word in the speaker data: and 

determining at least one likeliest word at each frame of the speaker data, 

wherein said step of determining posterior state occupancy probabilities fox each state in 
the speaker data at each time comprises the use of the following formula: 

where = P(yi, S t =s) and fl/ = P(yt+f I S t -s) for states s and a set of observations T, and 
where y t represents T observation frames of adaptation data. 

17. (Previously Presented) The method according to Claim 16, wherein said step of 
determining posterior word occupancy probabilities by summing over all states interior to each 
word in the speaker data comprises a determination using the following formula at each time 
frame: 



where w f is the set of states in word Wj. 
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18. (Currently Amended) Th e m e thod according to Claim 10, An apparatus for 
providing speaker adaptation in speech recognition, said apparatus comprising; 

at least one speech recognition model: 

an accepting arrangement which accepts speaker data: 

a lattice generator which generates a word lattice having a plurality of paths based on the 
speaker data, wherein the generation of the word lattice comprises consideration of language 
model probabilities by incorporating the language model probabilities into a transition 
probability: and 

a processing arrangement which adapts at least one of the speaker data and the at least 
one speech recognition model with respect to the generated word lattice in a manner to maximize 
the likelihood of the speaker data. 

wherein said generator is adapted to generate a maximum a-posteriori probability word 

lattice n 

wherein said generator is adapted to: 

determine posterior state occupancy probabilities for each state in the speaker data 
at each time: 

determine posterior word occupancy probabilities by summing over all states 
interior to each word in the speaker data: and 

determine at least one likeliest word at each frame of the speaker data. 
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wherein said determining posterior state occupancy probabilities for each state in the 
speaker data at each time comprises the use of the following formula: 

where ai = P(yi', S,=s) and 0} = P(y,+i T I S,=s) for states s and a set of observations T, and 
where y, r represents T observation frames of adaptation data. 
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19. (Previously Presented) The apparatus according to Claim 18* wherein said 
determining posterior word occupancy probabilities by summing over all states interior to each 
word in the speaker data comprises a determination using the following formula at each time 
frame: 

y 

where w f is the set of states in word WJ. 
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